Efficient Extraction for Mobile Web Access Log with Caching Strategy
نویسندگان
چکیده
Mobile web access log file plays an important role in the analysis about demand of mobile terminal market or user behavior. However, the log file data is highly dimensional, disorganized and semi-structured, which heightens the difficulty of data extracting accuracy; while it generates and transmits continuously, which poses an extracting efficiency challenge. It is highly desirable to extract the information embedded in log files as disorder or hidden situation efficiently and accurately. This paper proposes an efficient extracting method for mobile web access log with cache strategy. Firstly of all, data dictionary sets are built for each kind of complex field before extracting. Then, data is extracted based on the dictionaries, and the dictionaries will be completed simultaneously. Furthermore, with the discovery of the distribution of some data in the log generally following Zipf-like distribution, cache strategy is considered to be an auxiliary way to reduce mapping time. In addition, the classical cache strategy LFU is chosen. Ultimately, the experiment shows that the data could be extracted from the log accurately, and the extracting efficiency speeds up remarkably with LFU cache strategy.
منابع مشابه
A Novel Caching Strategy in Video-on-Demand (VoD) Peer-to-Peer (P2P) Networks Based on Complex Network Theory
The popularity of video-on-demand (VoD) streaming has grown dramatically over the World Wide Web. Most users in VoD P2P networks have to wait a long time in order to access their requesting videos. Therefore, reducing waiting time to access videos is the main challenge for VoD P2P networks. In this paper, we propose a novel algorithm for caching video based on peers' priority and video's popula...
متن کاملA Novel Caching Strategy in Video-on-Demand (VoD) Peer-to-Peer (P2P) Networks Based on Complex Network Theory
The popularity of video-on-demand (VoD) streaming has grown dramatically over the World Wide Web. Most users in VoD P2P networks have to wait a long time in order to access their requesting videos. Therefore, reducing waiting time to access videos is the main challenge for VoD P2P networks. In this paper, we propose a novel algorithm for caching video based on peers' priority and video's popula...
متن کاملMining Web Logs with PLSA Based Prediction Model to Improve Web Caching Performance
Web caching is a well-known strategy for improving the performance of web systems. The key to better web caching performance is an efficient replacing policy that keeps in the cache popular documents and replaces rarely used ones. When coupled with web log mining, the replacing policy can more accurately decide which documents should be cached. In this paper, we present a PLSA based prediction ...
متن کاملEfficient Proxy Server Caching Using Web Usage Mining Technique on Web Logs - for Improving Hit Rate and Response Time
This paper presents a vertical application of web usage mining: efficient web caching for improving the response time , for the internet users ,specially due to increase in number of users of e-commerce on the internet Introducing efficient web caching algorithms that employ predictive models of web requests; the general idea is to extend the cache replacement policies of proxy servers by makin...
متن کاملData Mining for Intelligent Web Caching
The paper presents a vertical application of data warehousing and data mining technology: intelligent web caching. We introduce several ways to construct intelligent web caching algorithms that employ predictive models of web requests; the general idea is to extend the LRU policy of web and proxy servers by making it sensible to web access models extracted from web log data using data mining te...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JCP
دوره 11 شماره
صفحات -
تاریخ انتشار 2016